Some Theory For Practical Classifier Validation

نویسندگان

  • Eric Bax
  • Ya Le
چکیده

We compare and contrast two approaches to validating a trained classifier while using all in-sample data for training. One is simultaneous validation over an organized set of hypotheses (SVOOSH), the well-known method that began with VC theory. The other is withhold and gap (WAG). WAG withholds a validation set, trains a holdout classifier on the remaining data, uses the validation data to validate that classifier, then adds the rate of disagreement between the holdout classifier and one trained using all in-sample data, which is an upper bound on the difference in error rates. We show that complex hypothesis classes and limited training data can make WAG a favorable alternative.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data dimensionality reduction with application to simplifying RBF network structure and improving classification performance

For high dimensional data, if no preprocessing is carried out before inputting patterns to classifiers, the computation required may be too heavy. For example, the number of hidden units of a radial basis function (RBF) neural network can be too large. This is not suitable for some practical applications due to speed and memory constraints. In many cases, some attributes are not relevant to con...

متن کامل

Toward a Computational Theory of Data Acquisition and Truthing

The creation of a pattern classifier requires choosing or creating a model, collecting training data and verifying or “truthing” this data, and then training and testing the classifier. In practice, individual steps in this sequence must be repeated a number of times before the classifier achieves acceptable performance. The majority of the research in computational learning theory addresses th...

متن کامل

Improving Cross-Validation Classifier Selection Accuracy through Meta-Learning

In order to choose from the large number of classification methods available for use, cross-validation error estimates are often employed. We present this cross-validation selection strategy in the framework of meta-learning and show that conceptually, metalearning techniques could provide better classifier selections than traditional cross-validation selection. Using various simulation studies...

متن کامل

Searching for the Origins of Schwab's Deliberative Curriculum Theory in the Thoughts of Aristotle, Dewey and Habermas

The main purpose of this study is exploring the roots and foundations of Schwab’s deliberative theory in curriculum. Therefore, after examining this theory in introduction, its foundations and origins were investigated. According to this, basic assumptions of this theory are practical and quasi practical arts, eclectic arts, commonplace and collective decision. Aristotle’s distinction between i...

متن کامل

Probabilistic Neural Network: Comparison of the Cross-validation Approach and a Fast Heuristic to Choose the Smoothing Parameters

It is well known that the Bayesian Classifier is the optimal classifier provided that p(X|c), the probability density function (PDF) with input attribute X given class c is known for any possible X and c. In practice, however, these PDFs are seldom given. Therefore, the Bayesian Classifier usually serves merely as a theoretical reference during the evaluation of other practical approaches. On t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1510.02676  شماره 

صفحات  -

تاریخ انتشار 2015